Understanding Spaghetti Models with Sequence Clustering for ProM
نویسندگان
چکیده
The goal of process mining is to discover process models from event logs. However, for processes that are not well structured and have a lot of diverse behavior, existing process mining techniques generate highly complex models that are often difficult to understand; these are called spaghetti models. One way to try to understand these models is to divide the log into clusters in order to analyze reduced sets of cases. However, the amount of noise and ad-hoc behavior present in real-world logs still poses a problem, as this type of behavior interferes with the clustering and complicates the models of the generated clusters, affecting the discovery of patterns. In this paper we present an approach that aims at overcoming these difficulties by extracting only the useful data and presenting it in an understandable manner. The solution has been implemented in ProM and is divided in two stages: preprocessing and sequence clustering. We illustrate the approach in a case study where it becomes possible to identify behavioral patterns even in the presence of very diverse and confusing behavior.
منابع مشابه
Requirements Elicitation as a Case of Social Process: An Approach to Its Description
Analyzing Resource Behavior Using Process Mining p. 69 Mobile Workforce Scheduling Problem with Multitask-Processes p. 81 Understanding Spaghetti Models with Sequence Clustering for ProM p. 92 Flexible Multi-dimensional Visualization of Process Enactment Data p. 104 Autonomous Optimization of Business Processes p. 116 Activity Mining by Global Trace Segmentation p. 128 A Formal Model for Proces...
متن کاملBPMN Miner 2.0: Discovering Hierarchical and Block-Structured BPMN Process Models
We present BPMN Miner 2.0: a tool that extracts hierarchical and block-structured BPMN process models from event logs. Given an event log in XES format, the tool partitions it into sub-logs (one per subprocess) and discovers a BPMN process model from each sub-log using existing techniques for discovering BPMN process models via heuristics nets or Petri nets. A drawback of these techniques is th...
متن کاملA Multi-Level Process Mining Framework for Correlating and Clustering of Biomedical Activities using Event Logs
Cost, time and resources are major factors affecting the quality of hospitals business processes. Bio-medical processes are twisted, unstructured and based on time series making it difficult to do proper process modeling for them. On other hand, Process mining can be used to provide an accurate view of biomedical processes and their execution. Extracting process models from biomedical code sequ...
متن کاملOptimization of Fortified Dough Composition for Spaghetti Production using Strong Wheat Flour
The effects of dough fortification with different amounts of gluten and full fat soya flour on the quality of spaghetti were investigated. Rheological properties of dough, quality and sensory characteristics of spaghetti with different amounts of gluten (8 to 14%) and full fat soya flour (0 to 20%) were evaluated. Fortification caused improvement in some characteristics such as dough stability ...
متن کاملMolecular Typing of Mycobacterium Tuberculosis Isolated from Iranian Patients Using Highly Abundant Polymorphic GC-Rich-Repetitive Sequence
Background: Tuberculosis (TB) with more than 10 million new cases per year and one of the top 10 causes of death worldwide, is still one of the most important global health problems. Also, multi drug-resistant tuberculosis (MDR) is a serious danger to public health. Understanding of the epidemiological pattern of mycobacterium tuberculosis (MTB), Estimates of recent transmission and recurrence ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009